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Abstract 

For input x, let F(x) denote the set of outputs that are the "legal" answers for a computational 
problem F. Suppose x and members of F{x) are so large that there is not time to read them in their 
entirety. We propose a model of local computation algorithms which for a given input x, support queries 
by a user to values of specified locations j/j in a legal output y 6 F(x). When more than one legal output 
y exists for a given x, the local computation algorithm should output in a way that is consistent with at 
least one such y. Local computation algorithms are intended to distill the common features of several 
concepts that have appeared in various algorithmic subfields, including local distributed computation, 
local algorithms, locally decodable codes, and local reconstruction. 

We develop a technique, based on known constructions of small sample spaces of fc-wise independent 
random variables and Beck's analysis in his algorithmic approach to the Lovasz Local Lemma, which 
under certain conditions can be applied to construct local computation algorithms that run in polyloga- 
rithmic time and space. We apply this technique to maximal independent set computations, scheduling 
radio network broadcasts, hypergraph coloring and satisfying fc-SAT formulas. 
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1 Introduction 



Classical models of algorithmic analysis assume that the algorithm reads an input, performs a computation 
and writes the answer to an output tape. On massive data sets, such computations may not be feasible, as both 
the input and output may be too large for a single processor to process. Approaches to such situations range 
from proposing alternative models of computation, such as parallel and distributed computation, to requiring 
that the computations achieve only approximate answers or other weaker guarantees, as in sublinear time 
and space algorithms. 

In this work, we consider the scenario in which only specific parts of the output y = (y\, . . . , y m ) are 
needed at any point in time. For example, if y is a description of a maximal independent set (MIS) of a 
graph such that yi = 1 if i is in the MIS and y-i = otherwise, then it may be sufficient to determine 
yi x , . . . , yi k for a small number of k vertices. To this end, we define a local computation algorithm, which 
supports queries of the form "yi =?" such that after each query by the user to a specified location i, the local 
computation algorithm is able to quickly outputs yi. For problems that allow for more than one possible 
output for an input x, as is often the case in combinatorial search problems, the local computation algorithm 
must answer in such a way that its current answer to the queiy is consistent with any past or future answers 
(in particular, there must always be at least one allowable output that is consistent with the answers of the 
local computation algorithm ). For a given problem, the hope is that the complexity of a local computation 
algorithm is proportional to the amount of the solution that is requested by the user. Local computation 
algorithms are especially adapted to computations of good solutions of combinatorial search problems (cf. 

EO M 123). 

Local computation algorithms are a formalization that describes concepts that are ubiquitous in the lit- 
erature, and arise in various algorithmic subfields, including local distributed computation, local algorithms, 
locally decodable codes and local reconstruction models. We subsequently describe the relation between 
these models in more detail, but for now, it suffices it to say that the aforementioned models are diverse in 
the computations that they apply to (for example, whether they apply to function computations, search prob- 
lems or approximation problems), the access to data that is allowed (for example, distributed computation 
models and other local algorithms models often specify that the input is a graph in adjacency list format), 
and the required running time bounds (whether the computation time is required to be independent of the 
problem size, or whether it is allowed to have computation time that depends on the problem size but has 
sublinear complexity). The model of local computation algorithms described here attempts to distill the 
essential common features of the aforementioned concepts into a unified paradigm. 

After formalizing our model, we develop a technique that is specifically suitable for constructing poly- 
logarithmic time local computation algorithms. This technique is based on Beck's analysis in his algorithmic 
approach to the Lovasz Local Lemma (LLL) ifTTTl . and uses the underlying locality of the problem to con- 
struct a solution. All of our constructions must process the data somewhat so that Beck's analysis applies, 
and we use two main methods of performing this processing. 

For maximal independent set computations and scheduling radio network broadcasts, we use a reduction 
of Parnas and Ron ll36l . which shows that, given a distributed network in which the underlying graph has 
degree bounded by a constant D, and a distributed algorithm (which may be randomized and only guaranteed 
to output an approximate answer) using at most t rounds, the output of any specified node of the distributed 
network can be simulated by a single processor with query access to the input with 0{D t+1 ) queries. For 
our first phase, we apply this reduction to a constant number of rounds of Luby's maximal independent set 
distributed algorithm ll29l in order to find a partial solution. After a large independent set is found, in the 
second phase, we use the techniques of Beck ifTTI to show 
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For hypergraph coloring and fe-SAT, we show that Alon's parallel algorithm, which is guaranteed to find 
solutions by the Lovasz Local Lemma 0, can be modified to run in polylogarithmic sequential time to 
answer queries regarding the coloring of a given node, or the setting of a given variable. For most of the 
queries, this is done by simulating the work of a processor and its neighbors within a small constant radius. 
For the remainder of the queries, we use Beck's analysis to show that the queries can be solved via the brute 
force algorithm on very small subproblems. 

Note that parallel 0(log n) time algorithms do not directly yield local algorithms under the reduction of 
Pamas and Ron (361 . Thus, we do not know how to apply our techniques to all problems with fast parallel 
algorithms, including certain problems in the work of Alon whose solutions are guaranteed by the Lovasz 
Local Lemma and which have fast parallel algorithms [2]. Recently Moser and Tardos (321I331 gave, under 
some slight restrictions, parallel algorithms which find solutions of all problems for which the existence of 
a solution is guaranteed by the Lovasz Local Lemma (TT1 I21. It has not yet been resolved whether one can 
construct local computation algorithms based on these more powerful algorithms. 

1.1 Related work 

Our focus on local computation algorithms is inspired by many existing works, which explicitly or implicitly 
construct such procedures. These results occur in a number of varied settings, including distributed systems, 
coding theory, and sublinear time algorithms. 

Local computation algorithms are a generalization of local algorithms, which for graph theoretic prob- 
lems represented in adjacency list format, produce a solution by adaptively examining only a constant sized 
portion of the input graph near a specified vertex. Such algorithms have received much attention in the dis- 
tributed computing literature under a somewhat different model, in which the number of rounds is bounded 
to constant time, but the computation is performed by all of the processors in the distributed network If34]l3l"l . 
Naor and Stockmeyer (34l and Mayer, Naor and Stockmeyer lETO investigate the question of what can be 
computed under these constraints, and show that there are nontrivial problems with such algorithms. Several 
more recent works investigate local algorithms for various problems, including coloring, maximal indepen- 
dent set, dominating set (some examples are in (25] HU E5 |27] |23] |22l |39l |U \M IH). Although all of these 
algorithms are distributed algorithms, those that use constant rounds yield (sequential) local computation 
algorithms via the previously mentioned reduction of Pamas and Ron (3610 

There has been much recent interest among the sublinear time algorithms community in devising local 
algorithms for problems on constant degree graphs and other sparse optimization problems. The goals of 
these algorithms have been to approximate quantities such as the optimal vertex cover, maximal matching, 
maximum matching, dominating set, sparse set cover, sparse packing and cover problems If36l |26] 1301 1351 
[T8l l42l . One feature of these algorithms is that they show how to construct an oracle which for each 
vertex returns whether it part of the solution whose size is being approximated - for example, whether it 
is in the vertex cover or maximal matching. Their results show that this oracle can be implemented in 
time independent of the size of the graph (depending only on the maximum degree and the approximation 
parameter). However, because their goal is only to compute an approximation of their quantity, they can 
afford to err on a small fraction of their local computations. Thus, their oracle implementations give local 
computation algorithms for finding relaxed solutions to the the optimization problems that they are designed 
for. For example, constructing a local computation algorithm using the oracle designed for estimating the 
size of a maximal independent set in (301 yields a large independent set, but not necessarily a maximal 

'Note that the parallel algorithms of Luby 1291 . Alon (2] and Moser and Tardos 1331 do not automatically yield local algorithms 
via this transformation since their parallel running times are t — Q(logn). 
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independent set. Constructing a local computation algorithm using the oracle designed for estimating the 
size of the vertex cover ll36l l30l l35l yields a vertex cover whose size is guaranteed to be only slightly larger 
than what is given by the 2-approximate algorithm being emulated - namely, by a multiplicative factor of at 
most 2 + 5 (for any 5 > 0). 

Recently, local algorithms have been demonstrated to be applicable for computations on the web graph. 
In Ifl9l[l2l [381 181171. local algorithms are given which, for a given vertex v in the web graph, computes an 
approximation to u's personalized PageRank vector and computes the vertices that contribute significantly 
to u's PageRank. In these algorithms, evaluations are made only to the nearby neighborhood of v, so 
that the running time depends on the accuracy parameters input to the algorithm, but there is no running 
time dependence on the size of the web-graph. Local graph partitioning algorithms have been presented in 
BT1 [H which find subsets of vertices whose internal connections are significantly richer than their external 
connections. The running time of these algorithms depends on the size of the cluster that is output, which 
can be much smaller than the size of the entire graph. 

Though most of the previous examples are for sparse graphs or other problems which have some sort of 
sparsity, local computation algorithms have also been provided for problems on dense graphs. The property 
testing algorithms of lfT7l use a small sample of the vertices (a type of a core-set) to define a good graph 
coloring or partition of a dense graph. This approach yields local computation algorithms for finding a large 
partition of the graph and a coloring of the vertices which has relatively few edge violations. 

The applicability of local computation algorithms is not restricted to combinatorial problems. One 
algebraic paradigm that has local computation algorithms is that of locally decodable codes ll2ll . described 
by the following scenario: Suppose m is a string with encoding y = E{m). On input x, which is close in 
Hamming distance to y, the goal of locally decodable coding algorithms is to provide quick access to the 
requested bits of m. More generally, the reconstruction models described in |Q~l[T3][37j describe scenarios 
where a string that has a certain property, such as monotonicity, is assumed to be corrupted at a relatively 
small number of locations. Let P be the set of strings that have the property. The reconstruction algorithm 
gets as input a string x which is close (in L\ norm), to some string y in P. For various types of properties 
P, the above works construct algorithms which give fast query access to locations in y. 

1.2 Folio wup work 

In a recently work, Alon et al. @ further show that the local computation algorithms in this work can be 
modified to not only run in poly logarithmic time but also in poly logarithmic space. Basically, they show 
that all the problems studied in this paper can be solved in a unified local algorithmic framework as follows: 
first the algorithm generates some random bits on a read-only random tape of size at most polylogarifhmic 
in the input, which may be thought as the "core" of a solution y £ F(x); then each bit of the solution y can 
be computed deterministically by first queiying a few bits on the random tape and then performing some 
fast computations on these random bits. The main technical tools used in Q are pseudorandomness, the 
random ordering idea of ll35l and the theory of branching processes. 

1.3 Organization 

The rest of the paper is organized as follows. In Section |2] we present our computation model. Some prelim- 
inaries and notations that we use throughout the paper appear in Section [3] We then give local computation 
algorithms for the maximal independent set problem and the radio network broadcast scheduling problem 
in Section|4]and Section[5l respectively. In Section|6]we show how to use the parallel algorithmic version of 
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the Lovasz Local Lemma to give local computation algorithms for finding the coloring of nodes in a hyper- 
graph. Finally, in Section |7J we show how to find settings of variables according to a satisfying assignment 
of a fc-CNF formula. 

2 Local Computation Algorithms: the model 

We present our model of local computation algorithms for sequential computations of search problems, 
although computations of arbitrary functions and optimization functions also fall within our framework. 

2.1 Model Definition 

We write n = \x\ to denote the length of the input. 

Definition 2.1. For input x, let F(x) = {y \ y is a valid solution for input x}. The search problem is to 
find any y G F(x). 

In this paper, the description of both x and y are assumed to be very large. 

Definition 2.2 ((i, s, <5)-local algorithms). Let x and F(x) be defined as above. A (t(n), s(n), <5(n))-local 
computation algorithm A is a ( randomized) algorithm which implements query access to an arbitrary y € 
F(x) and satisfies the following: A gets a sequence of queries i q for any q > and after each query 

ij it must produce an output yi } satisfying that the outputs y^, . . . , yi q are substrings of some y € F(x). 
The probability of success over all q queries must be at least 1 — 5(n). A has access to a random tape 
and local computation memory on which it can perform current computations as well as store and retrieve 
information from previous computations. We assume that the input x, the local computation tape and any 
random bits used are all presented in the RAM word model, i.e., A is given the ability to access a word of 
any of these in one step. The running time of A on any query is at most t(n), which is sublinear in n, and the 
size of the local computation memory of A is at most s(n). Unless stated otherwise, we always assume that 
the error parameter 5(n) is at most some constant, say, 1/3. We say that A is a strongly local computation 
algorithm if both t(n) and s(n) are upper bounded by log c nfor some constant c. 

Definition 2.3. Let SLC be the class of problems that have strongly local computation algorithms. 

Note that when > 1, the y according to which A outputs may depend on the previous queries to A 

as well as any random bits available to A. Also, we implicitly assume that the size of the output y is upper- 
bounded by some polynomial in The definition of local-computation algorithms rules out the possibility 
that the algorithms accomplish their computation by first computing the entire output. Analogous definitions 
can be made for a bit model. In principle, the model applies to general computations, including function 
computations, search problems and optimization problems of any type of object, and in particular, the input 
is not required by the model to be in a specific input format. 

The model presented here is intended be more general, and thus differs from other local computation 
models in the following ways. First, queries and processing time have the same cost. Second, the focus 
is on problems with slightly looser running time bound requirements - polylogarithmic dependence on the 
length of the input is desirable, but sublinear time in the length of the input is often nontrivial and can be 
acceptable. Third, the model places no restriction on the ability of the algorithm to access the input, as is the 
case in the distributed setting where the algorithm may only query nodes in its neighborhood (although such 
restrictions may be implied by the representation of the input). As such, the model may be less appropriate 
for certain distributed algorithms applications. 
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Definition 2.4 (Query oblivious). We say an LCA A is query order oblivious (query oblivious for short) if 
the outputs of A do not depend on the order of the queries but depend only on the input and the random bits 
generated by the random tape of A. 

Definition 2.5 (Parallelizable). We say an LCA A is parallelizable if A supports parallel queries. 

We remark that not all local algorithms in this paper are query oblivious or easily parallelizable. How- 
ever, this is remedied in 0. 

2.2 Relationship with other distributed and parallel models 

A question that immediately arises is to characterize the problems to which the local-computation algorithm 
model applies. In this subsection, we note the relationship between problems solvable with local computa- 
tion algorithms and those solvable with fast parallel or distributed algorithms. 

From the work of ll36l it follows that problems computable by fast distributed algorithms also have local 
computation algorithms. 

Fact 2.6 ([36]). If F is computable in t(n) rounds on a distributed network in which the processor intercon- 
nection graph has bounded degree d, then F has a d*( n ) -local computation algorithm. 

Parnas and Ron ll36l show this fact by observing that for any vertex v, if we run a distributed algorithm 
A on the subgraph Gk fV (the vertices of distance at most k from v), then it makes the same decision about 
vertex v as it would if we would run D for k rounds on the whole graph G. They then give a reduction from 
randomized distributed algorithms to sublinear algorithms based on this observation. 

Similar relationships hold in other distributed and parallel models, in particular, for problems com- 
putable by low depth bounded fan-in circuits. 

Fact 2.7. If F is computable by a circuit family of depth t{n) and fan-in bounded by d(n), then F has a 
dijifv 1 ' -local computation algorithm. 

Corollary 2.8. NC° C SLC. 

In this paper we show solutions to several problems NC l via local computation algorithms. However, 
this is not possible in general as: 

Proposition 2.9. NC 1 £ SLC. 

Proof. Consider the problem n-XOR, the XOR of n inputs. This problem is in NC 1 . However, no sublinear 
time algorithm can solve n-XOR because it is necessary to read all n inputs. □ 

In this paper, we give techniques which allow one to construct local computation algorithms based on 
algorithms for finding certain combinatorial structures whose existence is guaranteed by constructive proofs 
of the LLL in lfTTl l2l. It seems that our techniques do not extend to all such problems. An example of such a 
problem is Even cycles in a balanced digraph: find an even cycle in a digraph whose maximum in-degree is 
not much greater that the minimum out-degree. Alon [2] shows that, under certain restriction on the input 
parameters, the problem is in NC 1 . The local analogue of this question is to determine whether a given 
edge (or vertex) is part of an even cycle in such a graph. It is not known how to solve this quickly. 
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2.3 Locality-preserving reductions 



In order to understand better which problems can be solved locally, we define locality-preserving reductions, 
which capture the idea that if problem B is locally computable, and problem A has a locality-preserving 
reduction to B then A is also locally computable. 

Definition 2.10. We say that A is (t'(n), s'(n))-locality-preserving reducible to B via reduction H : S* — > 
T*, where E and T are the alphabets of A and B respectively, if H satisfies: 

1. x € A ^> H{x) € B. 

2. H is (t'(n), s'(n),0)-locally computable; that is, every word of H(x) can be computed by querying 
at most t(n) words of x. 

Theorem 2.11. If A is (t'(n), s'(n),0) -locality-preserving reducible to B and B is (t(n), s(n), 5(n))- 
locally computable, then A is (t(n) ■ t'(n), s(n) + s' '(n), 5 (n)) -locally computable. 

Proof. As A is (t'(n), s'(n), 0) -locality -preserving reducible to B, to determine whether x 6 A, it suffices 
to determine if H{x) € B. Each word of H(x) can be computed in time t'(n) and using space s'(n), and 
we need to access at most t(n) such words to determine whether H(x) € B. Note that we can reuse the 
space for computing H(x). □ 



3 Preliminaries 



Unless stated otherwise, all logarithms in this paper are to the base 2. Let N = {0, 1, . . .} denote the set of 
natural numbers. Let n > 1 be a natural number. We use [n] to denote the set {1, . . . , n}. 

Unless stated otherwise, all graphs are undirected. Let G = (V, E) be a graph. The distance between 
two vertices u and v in V(G), denoted by dc{u, v), is the length of a shortest path between the two vertices. 
We write Nc{v) = {u € V(G) : (u, v) 6 E(G)} to denote the neighboring vertices of v. Furthermore, let 
Nq(v) = N(v) U {v}. Let dc(v) denote the degree of a vertex v. Whenever there is no risk of confusion, 
we omit the subscript G from dc{u, v), da(v) and Nq(v). 

The celebrated Lovasz Local Lemma plays an important role in our results. We will use the simple 
symmetric version of the lemma. 

Lemma 3.1 (Lovasz Local Lemma 1151 ). Let A\, A2, ■ ■ ■ , A n be events in an arbitrary probability space. 
Suppose that the probability of each of these n events is at most p, and suppose that each event A,- L is mutually 
independent of all but at most d of other events Aj. Ifep(d + 1) < 1, then with positive probability none of 
the events Ai holds, i.e., 

Pr[n" =1 i 4 ] > 0. 

Several of our proofs use the following graph theoretic structure: 

Definition 3.2 ( ifTTTD . Let G = (V, E) be an undirected graph. Define W C V(G) to be a 3-tree if the 
pairwise distances of all vertices in W are each at least 3 and the graph G* = (W, E*) is connected, where 
E* is the set of edges between each pair of vertices whose distance is exactly 3 in G. 
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4 Maximal Independent Set 



An independent set (IS) of a graph G is a subset of vertices such that no two vertices are adjacent. An 
independent set is called a maximal independent set (MIS) if it is not properly contained in any other IS. 
It is well-known that a sequential greedy algorithm finds an MIS S in linear time: Order the vertices in G 
as 1, 2, • • • , n and initialize S to the empty set; for i = 1 to n, if vertex i is not adjacent to any vertex in 
S, add i to S. The MIS obtained by this algorithm is call the lexicographically first maximal independent 
set (LFMIS). Cook |[T4l showed that deciding if vertex n is in the LFMIS is P-complete with respect to 
logspace reducibility. On the other hand, fast randomized parallel algorithms for MIS were discovered in 
1980's |[20l[29l [3l. The best known distributed algorithm for MIS runs in 0(log* n) rounds with a word-size 
of O(logra) ifToll . By Fact 12. 61 this implies a d°( log n local computation algorithm. In this section, we 
give a query oblivious and parallelizable local computation algorithm for MIS based on Luby's algorithm 
as well as the techniques of Beck ifTTI . and runs in time 0(d 0( - dlogd ^ ■ log n). 

Our local computation algorithm is partly inspired by the work of Marko and Ron QUI . There they 
simulate the distributed algorithm for MIS of Luby ||29l in order to approximate the minimum number of 
edges one need to remove to make the input graph free of some fixed graph H. In addition, they show that 
similar algorithm can also approximate the size of a minimum vertex cover. We simulate Luby's algorithm 
to find an exact and consistent local solution for the MIS problem. Moreover, the ingredient of applying 
Beck's idea to run a second stage greedy algorithm on disconnected subgraphs seems to be new. 

4.1 Overview of the algorithm 

Let G be an undirected graph on n vertices and with maximum degree d. On input a vertex v, our algorithm 
decides whether v is in a maximal independent set using two phases. In Phase 1, we simulate Luby's parallel 
algorithm for MIS ll29l via the reduction of (36]. That is, in each round, v tries to put itself into the IS with 
some small probability. It succeeds if none of its neighbors also tries to do the same. We run our Phase 1 
algorithm for 0(dlog d) rounds. As it turns out, after Phase 1, most vertices have been either added to the 
IS or removed from the graph due to one (or more) of their neighbors being in the IS. Our key observation is 
that - following a valiant of the argument of Beck ifTTTl - almost surely, all the connected components of the 
surviving vertices after Phase 1 have size at most 0(log n). This enables us to perform the greedy algorithm 
for the connected component v lies in. 

Our main result in this section is the following. 

Theorem 4.1. Let G be an undirected graph with n vertices and maximum degree d. Then there is a 
(0(<i ( dlogd ) • logn), 0(n), \jn)-local computation algorithm which, on input a vertex v, decides ifv is in 
a maximal independent set. Moreover, the algorithm will give a consistent MIS for every vertex in G. 

4.2 Phase 1: simulating Luby's parallel algorithm 

Figure [TJillustrates Phase 1 of our local computation algorithm for Maximal Independent Set. Our algorithm 
simulates Luby's algorithm for r = 0(d log d) rounds. Every vertex v will be in one of three possible states: 

• "selected" — v is in the MIS; 

• "deleted" — one of u's neighbors is selected and v is deleted from the graph; and 

• "_L" — v is not in either of the previous states. 
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Initially, every vertex is in state "_L". Once a vertex becomes "selected" or "deleted" in some round, it 
remains in that state in all the subsequent rounds. 

The subroutine MIS(t> , i) returns the state of a vertex v in round i. In each round, if vertex v is still in 
state "_L", it "chooses" itself to be in the MIS with probability l/2d. At the same time, all its neighboring 
vertices also flip random coins to decide if they should "choose" themselves^ If v is the only vertex in 
N + (v) that is chosen in that round, we add v to the MIS ("select" v) and "delete" all the neighbors of v. 
However, the state of v in round % is determined not only by the random coin tosses of vertices in N + (v) 
but also by these vertices' states in round i — 1. Therefore, to compute MIS(w, i), we need to recursively 
call MIS(ti, i — 1) for every il £ N + (v). H By induction, the total running time of simulating r rounds is 

d O(r) = d O(d\ogd)_ 

If after Phase 1 all vertices are either "selected" or "deleted" and no vertex remains in "_L" state, then 
the resulting independent set is a maximal independent set. In fact, one of the main results in |[29l is that this 
indeed is the case if we run Phase 1 for expected 0(log n) rounds. Our main observation is, after simulating 
Luby's algorithm for only 0(d log d) (a constant independent of the size n) rounds we are already not far 
from a maximal independent set. Specifically, if vertex v returns "_L" after Phase 1 of the algorithm, we call 
it a surviving vertex. Now consider the subgraph induced on the surviving vertices. Following a variant of 
Beck's argument ifTTTl . we show that, almost surely, no connected component of surviving vertices is larger 
than poly((i) • logn. 

Let A v be the event that vertex v is a surviving vertex. Note that event A v depends on the random coin 
tosses v and u's neighborhood of radius r made during the first r rounds, where r = O(dlogd). To get 
rid of the complication caused by this dependency, we consider another set of events generated by a related 
random process. 

Consider a valiant of our algorithm MIS, which we call MIS# as shown in Fig [2] In MIS#, every 
vertex v has two possible states: "picked" and "_L". Initially, every vertex is in state "_L". Once a vertex 
becomes "picked" in some round, it remains in the "picked" state in all the subsequent rounds. MIS b and 
MIS are identical except that, in MIS#, if in some round a vertex v is the only vertex in N + (v) that is 
chosen in that round, the state of v becomes "picked", but we do not change the states of v 's neighboring 
vertices. In the following rounds, v keeps flipping coins and tries to choose itself. Moreover, we assume 
that, for any vertex v and in every round, the randomness used in MIS and MIS b are identical as long as 
v has not been "selected" or "deleted" in MIS. If v is "selected" or "deleted" in MIS, then we flip some 
additional coins for v in the subsequent rounds to run MIS^. We let B v be the event that v is in state "_L" 
after running MIS b for r rounds (that is, v is never get picked during all r rounds of MIS b)- 

Claim 4.2. A v C B v for every vertex v 

Proof. This follows from the facts that a necessary condition for A v to happen is v never get "selected" in 
any of the r rounds and deleting the neighbors of v from the graph can never decrease the probability that v 
gets "selected" in any round. Specifically, we will show that B v C A v . 

Note that B v = VJ r i=l B v , where B v is the event that v is picked for the first time in round i in MIS# 
(v may get picked again in some subsequent rounds). Similarly, A v = L)^ =1 A V , where A v is the event 

2 We store all the randomness generated by each vertex in each round so that our answers will be consistent. However, we 
generate the random bits only when the state of corresponding vertex in that round is requested. 

3 A subtle point in the subroutine MIS(u, i) is that when we call MIS(u, i), we only check if vertex v is "selected" or not in 
round i. If v is "deleted" in round i, we will not detect this until we call MIS(w, i + 1), which checks if some neighboring vertex 
of v is selected in round i. However, such "delayed" decisions will not affect our analysis of the algorithm. 

Strictly speaking, the probability spaces in which A v and B v live are different. Here the claim holds for any fixed outcomes of 
all the additional random coins MIS b flips. 
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Maximal Independent Set: Phase 1 
Input: a graph G and a vertex v e V 
Output: {"true", "false", "_L"} 
For i from 1 to r = 20d log d 

(a) If MIS(v, i) = "selected" 
return "true" 

(b) Else if MIS(u, i) = "deleted" 
return "false" 

(c) Else 
return "_L" 

MIS(u,i) 

Input: a vertex v 6 V and a round number i 
Output: {"selected", "deleted", "_L"} 

1 . If v is marked "selected" or "deleted" 

return "selected" or "deleted", respectively 

2. For every u in N(v) 

IfMIS(«,i- 1) = "selected" 

mark v as " deleted" and return "deleted" 

3. v chooses itself independently with probability A 

If v chooses itself 

(i) For every u in N(v) 

If u is marked "_L", u chooses itself independently with probability i 

(ii) If v has a chosen neighbor 
return "_L" 

(iii) Else 

mark v as "selected" and return "selected" 

Else 

return "_L" 



Figure 1 : Local Computation Algorithm for MIS : Phase 1 



MISb(w,») 

Input: a vertex v £ V and a round number i 
Output: {"picked", "_L"} 

1 . If v is marked "picked" 

return "picked" 

2. v chooses itself independently with probability i 

If v chooses itself 

(i) For every u in N(v) 

u chooses itself independently with probability A 

(ii) If v has a chosen neighbor 
return "_L" 

(iii) Else 

mark v as "picked" and return "picked" 

Else 

return "_L" 



Figure 2: Algorithm MIS B 
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that v is selected or deleted in round i in MIS. Hence, we can write, for every i, A v i] = Sell U Del^, 
where Sell and Dell are the events that, in MIS, v gets selected in round i and v get deleted in round i, 
respectively. 

We prove by induction on i that U*- =1 B„ C \J j= {A^\ This is clearly true for i = 1 as By = Se\£\ 
Assume it holds for all smaller values of i. Consider any fixed random coin tosses of all the vertices in the 
graph before round i such that v is not "selected" or "deleted" before round i. Then by induction hypothesis, 
v is not picked in MIS b before round i either. Let N^' (v) be the set of neighboring nodes of v that are in 
state "_L" in round i in algorithm MIS B . Clearly, N^(v) C N(v). 

Now for the random coins tossed in round i, we have 

r{\ 

B v = {v chooses itself in round i} C\ we .N(v) {w does not chooses itself in round i} 
C {v chooses itself in round i} ^ we N^(v) i w ^ oes not chooses itself in round i) 
= Sell°. 

Therefore, B% C Sell C A„ . This finishes the inductive step and thus completes the proof of the 
claim. □ 

As a simple corollary, we immediately have 

Corollary 4.3. For any vertex set W C V(G), Pr[r\ veW A v ] < Pr[n„ e v^-B„]. 

A graph H on the vertices V(G) is called a dependency graph for {B v } v€ ytQ\ if for all v the event B v 
is mutually independent of all B u such that (it, v) £ H. 

Claim 4.4. The dependency graph H has maximum degree d 2 . 

Proof. Since for every vertex v, B v depends only on the coin tosses of v and vertices in N(v) in each of the 
r rounds, the event B v is independent of all A u such that du{u, v) > 3. The claim follows as there are at 
most d? vertices at distance 1 or 2 from v. □ 

Claim 4.5. For every v G V, the probability that B v occurs is at most 1 /8d s . 

Proof. The probability that vertex v is chosen in round i is ^. The probability that none of its neighbors is 
chosen in this round is (1 - ±) d ^ > (1 - ±) d > 1/2. Since the coin tosses of v and vertices in N(v) are 
independent, the probability that v is selected in round i is at least A • | = i. We get that the probability 
that B v happens is at most (1 - ^) 2Mlogd < g^. □ 

Now we are ready to prove the main lemma for our local computation algorithm for MIS. 

Lemma 4.6. After Phase 1, with probability at least 1 — 1/n, all connected components of the surviving 
vertices are of size at most 0(poly(d) • log n). 

Proof. Note that we may upper bound the probability that all vertices in W are surviving vertices by the 
probability that all the events {B v } vG yv happen simultaneously: 

Pr[all vertices in W are surviving vertices] 

= Pr[n veW A v ] 
< Pr[n veW B v }. 
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The rest of the proof is similar to that of Beck lITTI . We bound the number of 3-trees in H (the dependency 
graph for events {B v }) of 3-trees of size w as follows. 

Let H 3 denote the "distance-3" graph of H, that is, vertices u and v are connected in H 3 if their distance 
in H is exactly 3. We claim that, for any integer w > 0, the total number of 3-trees of size w in H 3 is at 
most n(4d 3 ) w . To see this, first note that the number of non-isomoiphic trees on w vertices is at most 4 W 
(see e.g. |[28l ). Now fix one such tree and denote it by T. Label the vertices of Tby v±, i>2, • • • , v w in a way 
such that for any j > 1, vertex Vj is adjacent to some Vi with i < j in T. How many ways are there to 
choose v\, V2, ■ ■ ■ , v w from V{H) so that they can be the set of vertices in T? There are n choices for v±. 
As H 3 has maximum degree D = d(d — l) 2 < d 3 , therefore there are at most D possible choices for V2- 
and by induction there are at most nD w ~ 1 < nd 3w possible vertex combinations for T. Since there can be 
at most 4 W different T's, it follows that there are at most n(4d 3 ) w possible 3-trees in G. 

Since all vertices in W are at least 3-apart, all the events {B v } ve \y are mutually independent. Therefore 
we may upper bound the probability that all vertices in W are surviving vertices as 

Pr[n veW B v ] 

= n pt[b v ] 



< 



1 x w 



8d 3 

where the last inequality follows from Claim 1431 

Now the expected number of 3-trees of size w is at most 

i3\w ' 1 



n{4d )W [sd 3 ) = n2 ' w - 1/n ' 

for w = c\ log n, where c\ is some constant. By Markov's inequality, with probability at least 1 — 1/n, there 
is no 3-tree of size larger than c\ log n. By a simple variant of the 4-tree Lemma in ifTTTl (that is, instead 
of the "4-tree lemma", we need a "3-tree lemma" here), we see that a connected component of size s in H 
contains a 3-tree of size at least s/d 3 . Therefore, with probability at least 1 — 1/n, there is no connected 
surviving vertices of size at least 0(poly(d) • log n) at the end of Phase 1 of our algorithm. □ 



4.3 Phase 2: Greedy search in the connected component 

If v is a surviving vertex after Phase 1, we need to perform Phase 2 of the algorithm. In this phase, we 
first explore v's connected component, C(v ), in the graph induced on G by all the vertices in state "_L". If 
the size of C(v) is larger than C2 log n for some constant C2 which depends only on d, we abort and output 
"Fail". Otherwise, we perform the simple greedy algorithm described at the beginning of this section to find 
the MIS in C(v). To check if any single vertex is in state "_L", we simply run our Phase 1 algorithm on this 
vertex and the running time is J°( rflo s rf ) for each vertex in C(v). Therefore the running time for Phase 2 is 
at most 0(\C(v)\) • d°( dlo s d ) < 0(d 0{dlo&(V > -logn). As this dominates the running time of Phase 1, it is 
also the total running time of our local computation algorithm for MIS. 

Finally, as we only need to store the random bits generated by each vertex during Phase 1 of the algorithm 
and bookkeep the vertices in the connected component during Phase 2 (which uses at most 0(log n) space), 
the space complexity of the local computation algorithm is therefore 0(n). 
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5 Radio Networks 



For the purposes of this section, a radio network is an undirected graph G = (V, E) with one processor 
at each vertex. The processors communicate with each other by transmitting messages in a synchronous 
fashion to their neighbors. In each round, a processor P can either receive a message, send messages to all 
of its neighbors, or do nothing. We will focus on the radio network that is referred to as a Type II network^ 
in ED. P receives a message from its neighbor Q if P is silent, and Q is the only neighbor of P that transmits 
in that round. Our goal is to check whether there is a two-way connection between each pair of adjacent 
vertices. To reach this goal, we would like to find a schedule such that each vertex in G broadcasts in one of 
the K rounds and K is as small as possible. 

Definition 5.1 (Broadcast function). Let G = (V, E) be an undirected graph. We say F r : V — > [K] is a 
broadcast function for the network G if the following holds: 

1. Every vertex v broadcasts once and only once in round F r (v) to all its neighboring vertices; 

2. No vertex receives broadcast messages from more than one neighbor in any round; 

3. For every edge (it, v) € G, u and v broadcast in distinct rounds. 

Let A be the maximum degree of G. Alon et. al. H |5l show that the minimum number of rounds K 
satisfies K = O(AlogA). Furthermore, Alon [Q gives an NC\ algorithm that computes the broadcast 
function F r with K = 0(A log A). Here we give a local computation algorithm for this problem, i.e. given 
a degree-bounded graph G = (V, E) in the adjacency list form and a vertex v € V, we output the round 
number in which v broadcasts in logarithmic time. Our solution is consistent in the sense that all answers 
our algorithm outputs to the various v G V agree with some broadcast scheduling function F r . 

Let G 1,2 be the "square graph" of G; that is, u and v are connected in G 1 ' 2 if and only if their distance in 
G is either one or two. Our algorithm is based on finding an independent set cover of G 1,2 which simulates 
Luby's Maximal Independent Set algorithm [29). Note that if we denote the maximum degree of G 1,2 by d, 
then d < A 2 . 

Definition 5.2 (Independent Set Cover). Let H = (V, E) be an undirected graph. A collection of vertex 
subsets {S\, . . . , St} is an independent set cover (ISC) for H if these vertex sets are pairwise disjoint, each 
Si is an independent set in H and their union equals V. We call t the size of ISC {S\, . . . , St}. 

Fact 5.3. If {S±, . . . , St} is an ISC for G 1 ' 2 , then the function defined by F r (v) = i iff v € Si is a broadcast 
function. 

Proof. First note that, since the union of {Si} equals V, F r (v) is well-defined for every v € G. That is, 
every v broadcasts in some round in [t], hence both directions of every edge are covered in some round. As 
v can only be in one IS, it only broadcasts once. Second, for any two vertices u and v, if d(u, v) > 3, 
then N(u) n N(v) = 0. It follows that, if in each round all the vertices that broadcast are at least 3-apart 
from each other, no vertex will receive more than one message in any round. Clearly the vertices in an 
independent set of G 1,2 have the property that all the pairwise distances are at least 3. □ 

The following is a simple fact about ISCs. 

5 The other model, Type I radio network, is more restrictive: A processor P receives a message from its neighbor Q in a given 
round only if P is silent, Q transmits and P chooses to receive from Q in that round. 
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Fact 5.4. For every undirected graph H on n vertices with maximum degree d, there is an ISC of size at 
most d. Moreover, such an ISC can be found by a greedy algorithm in time at most 0(dn). 

Proof. We repeatedly apply the greedy algorithm that finds an MIS in order to find an ISC. Recall that the 
greedy algorithm repeats the following until the graph has no unmarked vertex: pick an unmarked vertex v, 
add it to the IS and mark off all the vertices in N(v). Clearly each IS found by the greedy algorithm has 
size at least gxj. To partition the vertex set into an ISC, we run this greedy algorithm to find an IS which 
we call Si , and delete all the vertices in Si from the graph. Then we run the greedy algorithm on the new 
graph again to get 52, and so on. After running at most d rounds (since each round reduces the maximum 
degree of the graph by at least one), we partition all the vertices into an ISC of size at most d and the total 
running time is at most O(dn). □ 

Our main result in this section is a local computation algorithm that computes an ISC of size 0(dlog d) 
for any graph of maximum degree d. On input a vertex v, our algorithm outputs the index i of a vertex subset 
Si to which v belongs, in an ISC of H. We will call i the round number of v in the ISC. By Fact |5.3[ applying 
this algorithm to graph G 1 ' 2 gives a local computation algorithm that computes a broadcast function for G. 

5.1 A local computation algorithm for ISC 

Our main result for computing an ISC is summarized in the following theorem. 

Theorem 5.5. Let H be an undirected graph on n vertices with maximum degree d. Then there is a (poly (d) ■ 
log n, 0(n), \jn)-local computation algorithm which, on input a vertex v, computes the round number ofv 
in an ISC of size at most 0(dlog d). Moreover, the algorithm will give a consistent ISC for every vertex in 
H. 

On input a vertex v, our algorithm computes the round number of v in two phases. In Phase 1 we 
simulate Luby's algorithm for MIS ||29l for 0(dlogd) rounds. At each round, v tries to put itself in the 
independent set generated in that round. That is, v chooses itself with probability l/2d and if none of its 
neighbors choose themselves, then v is selected in that round and we output that round number for v. As 
we show shortly, after Phase 1 , most vertices will be assigned a round number. We say v survives if it is not 
assigned a round number. We consider the connected component containing v after one deletes all vertices 
that do not survive from the graph. Following an argument similar to that of Beck flTTfl. almost surely, all 
such connected components of surviving vertices after Phase 1 have size at most <3(log n). This enables us, 
in Phase 2, to perform the greedy algorithm on v's connected component to deterministically compute the 
round number of v in time 0(iog n). 

5.1.1 Phase 1 algorithm 

Phase 1 of our local computation algorithm for computing an ISC is shown in Figure [3] 

For every v € V, let A v be the event that vertex v returns "_L", i.e. v is not selected after r rounds. We 
call such a v a surviving vertex. After deleting all v that do not survive from the graph, we are interested in 
bounding the size of the largest remaining connected component. Clearly event A v depends on the random 
coin tosses of v and v's neighboring vertices in all the r rounds. A graph H on the vertices V(H) (the indices 

6 In 2(b), we flip random coins for u even if u is selected in a previous round. We do this for the technical reason that we want 
to rid the dependency of v on nodes that are not neighbors to simplify our analysis. Thus our analysis is overly pessimistic since if 
selected neighbors stop choosing themselves, it only increases the chance of v being selected. 
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Independent Set Cover: Phase 1 

Input: a graph H and a vertex v G V 

Output: the round number of v in the ISC or "_L" 

1. Initialize all vertices in N + (v) to state "_L" 

2. For i = 1 to r = 20d log d 

(a) If wis labeled "_L" 

v chooses itself independently with probability 

(b) If w chooses itself 

(i) For every u € 7V(i>) 

(even if u is labeled "selected in round j" 

for some j < i, we still flip random coins for it) 

u chooses itself independently with probability 

(ii) If v has a chosen neighbor, 
v unchooses itself 

(iii) Else 

v is labeled "selected in round i" 
return i 

3. return "_L" 



Figure 3: Algorithm for finding an Independent Set Cover: Phase 1. 

for the A v ) is called a dependency graph for {A v } v& y^ if for all v the event A v is mutually independent 
of all A u with (it, v) H. 

The following two claims are identical to Claim l4~4l and Claim l431 in Section[4]respectively, we therefore 
omit the proofs. 

Claim 5.6. The dependency graph H has maximum degree d 2 . 

Claim 5.7. For every v 6 V, the probability that A v occurs is at most 1 / 8c? 3 . 

The following observation is crucial in our local computation algorithm. 

Lemma 5.8. After Phase 1, with probability at least 1 — 1 jn, all connected components of the surviving 
vertices are of size at most 0(poly(d) • log n). 

Proof. The proof is almost identical to that of Lemma 1431 but is only simpler: we can directly upper bound 
the probability 

Pr[all vertices in W are surviving vertices] = Pr[r\, e vy^4„] 
by way of Beck ifTlTl without resorting to any other random process. We omit the proof. □ 

5.1.2 Phase 2 algorithm 

If v is a surviving vertex after Phase 1 , we perform Phase 2 of the algorithm. In this phase, we first explore 
the connected component, C(v), that the surviving vertex v lies in. If the size of C(v) is larger than C2 log n 
for some constant 02(d) depending only on d, we abort and output "Fail". Otherwise, we perform the simple 
greedy algorithm described in Fact |5.4| to partition C(v ) into at most d subsets deterministically. The running 
time for Phase 2 is at most poly(o!) • logn. Since any independent set of a connected component can be 
combined with independent sets of other connected components to form an IS for the surviving vertices, we 
conclude that the total size of ISC we find is 0(dlog d) + d = 0(dlog d). 
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5.2 Discussions 



Now a simple application of Theorem 15.51 to G 1,2 gives a local computation algorithm for the broadcast 
function. 

Theorem 5.9. Given a graph G = (V, E) with n vertices and maximum degree A and a vertex v £ V, there 
exists a (poly(A) ■ logn, 0(n), \/n)-local computation algorithm that computes a broadcast function with 
at most 0(A 2 log A) rounds. Furthermore, the broadcast function it outputs is consistent for all queries to 
the vertices of the graph. 

We note our round number bound is quadratically larger than that of Alon's parallel algorithm We 
do not know how to turn his algorithm into a local computation algorithm. 

6 Hypergraph two-coloring 

A hypergraph H is a pair H = (V, E) where V is a finite set whose elements are called nodes or vertices, 
and E is a family of non-empty subsets of V, called hyperedges. A hypergraph is called k-uniform if 
each of its hyperedges contains precisely k vertices. A two-coloring of a hypergraph H is a mapping 
c : V — > {red, blue} such that no hyperedge in E is monochromatic. If such a coloring exists, then we say 
H is two-colorable. We assume that each hyperedge in H intersects at most d other hyperedges. Let N be 
the number of hyperedges in H. Here we think of k and d as fixed constants and all asymptotic forms are 
with respect to N. By the Lovasz Local Lemma, when e(d+ 1) < 2 fe_1 , the hypergraph H is two-colorable. 

Let m be the total number of vertices in H. Note that m < kN, so m = O(N). For any vertex x € V, 
we use £(x) to denote the set of hyperedges x belongs to. For convenience, for any hypergraph H = (V, E), 
we define an m-by-iV vertex-hyperedge incidence matrix A4 such that, for any vertex x and hyperedge e, 
M x ,e = 1 if e G £(%) and M x ,e = otherwise. A natural representation of the input hypergraph H is this 
vertex-hyperedge incidence matrix M.. Moreover, since we assume both k and d are constants, the incidence 
matrix M. is necessarily very sparse. Therefore, we further assume that the matrix M. is implemented via 
linked lists for each row (that is, vertex x) and each column (that is, hyperedge e). 

Let G be the dependency graph of the hyperedges in H. That is, the vertices of the undirected graph G 
are the N hyperedges of H and a hyperedge Ei is connected to another hyperedge Ej in G if Ei D Ej ^ 0. 
It is easy to see that if the input hypergraph is given in the above described representation, then we can find 
all the neighbors of any hyperedge Ei in the dependency graph G (there are at most d of them) in 0(log N) 
time. 

6.1 Our main result 

A natural question to ask is: Given a two-colorable hypergraph H and a vertex v € V(H), can we quickly 
compute the coloring of v? Here we would like the coloring to be consistent, meaning all the answers we pro- 
vide must come from the same valid two-coloring. Our main result in this section is, given a two-colorable 
hypergraph H whose two-coloring scheme is guaranteed by the Lovasz Local Lemma (with slightly weaker 
parameters), we give a local computation algorithm which answers queries of the coloring of any single 
vertex in polylogiV time, where N is the number of the hyperedges in H. The coloring returned by our 
oracle will agree with some two-coloring of the hypergraph with probability at least 1 — 1/N. 
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Theorem 6.1. Let d and k be such that there exist three positive integers k\, &2 and ks such that the fallow- 
ings hold: 



k\ + k 2 + k 3 = k, 



16d(d 
16d(d 



- lf(d + l) < 2 k \ 
-l) 3 (d+l) <2 k \ 
2e(d + l) < 2 k \ 



Then there exists a (polylogiV, O(N), 1 /N)-local computation algorithm which, given a hypergraph H and 
any sequence of queries to the colors of vertices (x\, X2, ■ ■ ■ ,x s ), returns a consistent coloring for all x% 's 
which agrees with some 2-coloring of H. 

6.2 Overview of the coloring algorithm 

Our local computation algorithm imitates the parallel coloring algorithm of Alon J2]. Recall that Alon's 
algorithm runs in three phases. In the first phase, we randomly color each vertex in the hypergraph following 
some arbitrary ordering of the vertices. If some hyperedge has k\ vertices in one color and no vertices in the 
other color, we call it a dangerous edge and mark all the remaining vertices in that hyperedge as troubled. 
These troubled vertices will not be colored in the first phase. If the queried vertex becomes a troubled vertex 
from the coloring process of some previously queried vertex, then we run the Phase 2 coloring algorithm. 
There we first delete all hyperedges which have been assigned both colors and call the remaining hyperedges 
surviving edges. Then we repeat the same process again for the surviving hyperedges, but this time a 
hyperedge becomes dangerous if k\ + k% vertices are colored the same color and no vertices are colored by 
the other color. Finally, in the third phase, we do a brute-force search for a coloring in each of the connected 
components of the surviving vertices as they are of size 0(log log N) almost surely. 

A useful observation is, in the first phase of Alon's algorithm, we can color the vertices in arbitrary 
order. In particular, this order can be taken to be the order that queries to the local computation algorithm 
are made in. If the coloring of a vertex x can not be determined in the first phase, then we explore the 
dependency graph around the hyperedges containing x and find the connected component of the surviving 
hyperedges to perform the second phase coloring. To ensure that all the connected components of surviving 
hyperedges resulting from the second phase coloring are of small sizes, we repeat the second phase colorings 
independently many times until the connected components sizes are small enough. If that still can not decide 
the coloring of x, then we run the third (and final) phase of coloring, in which we exhaustively search for 
a two-coloring for vertices in some very small (i.e., of size at most 0(log log N)) connected component in 
G as guaranteed by our second phase coloring. Following Alon's analysis, we show that with probability at 
least 1 — 1 /TV, the total running time of all these three phases for any vertex in H is polylog/V. 

During the execution of the algorithm, each hyperedge will be in either initial, safe, unsafe-1, unsafe-2, 
dangerous-1 or dangerous-2 state. Vertices will be in either uncolored, red, blue, trouble-! or trouble-2 
state. The meanings of all these states should be clear from their names. Initially every hyperedge is in 
initial state and every vertex is in uncolored state. 

6.3 Phase 1 coloring 

If x is already colored (that is, x is in either red or blue state), then we simply return that color. If x is 
in the trouble-! state, we invoke Phase 2 coloring for vertex x. If x is in the trouble-2 state, we invoke 
Phase 3 coloring for vertex x. If x is uncolored, then we flip a fair coin to color x red or blue with equal 



16 



Phase 1 Coloring(x) 




Input: a vertex x <E V 




Output: a color in { red, blue } 




1 . If a; is already colored 




Return the color of x 




2. If x is in trouble-1 state 




Return Phase 2 Coloring(x) 




3. If x is in trouble-2 state 




Return Phase 3 Coloring(x) 




4. If x is in uncolored state 




(a) Uniformly at random choose a color c 


for x from { red, blue } 


(b) Update the states of all hyperedges in 


S(x) 


(c) Return color c 





Figure 4: Phase 1 coloring algorithm 



probability (that is, vertex x's state becomes red or blue, respectively). After that, we update the status of 
all the hyperedges in £(x). Specifically, if some Ei € £(x) has k\ vertices in one color and no vertices in 
the other color, then we change Ei from initial into dangerous-1 state. Furthermore, all uncolored vertices 
in Ei will be changed to trouble-1 states. On the other hand, if both colors appear among the vertices of 
Ei, we update the state of Ei from initial to safe. If none of the vertices in a hyperedge is uncolored and 
the hyperedge is still in initial state (that is, it is neither safe or dangerous-1), then we change its state to 
unsafe-1. Note that if a hyperedge is unsafe-1 then all of its vertices are either colored or in trouble-1 state, 
and the colored vertices are monochromatic. 

Running time analysis. The running time of Phase 1 coloring for an uncolored vertex x is 0(kd) = 0(1) 
(recall that we assume both k and d are constants). This is because vertex x can belong to at most d + 1 
hyperedges, hence there are at most k(d+l) vertices that need to be updated during Phase 1. If a; is already a 
colored vertex, the running time is clearly 0(1). Finally, the running time of Phase 1 coloring for a trouble-1 
or trouble-2 vertex is O(l) plus the running time of Phase 2 coloring or 0(1) plus the running time of Phase 
3 coloring, respectively. 

6.4 Phase 2 coloring 

During the second phase of coloring, given an input vertex x (which is necessarily a trouble-1), we first 
explore the dependency graph G of the hypergraph H by keep coloring some other vertices whose colors 
may have some correlation with the coloring of x. In doing so, we grow a connected component of surviving- 
1 hyperedges containing x in G. Here, a hyperedge is called surviving-1 if it is either dangerous-1 or 
unsafe-1. We denote this connected component of surviving-1 hyperedges surrounding vertex x by C\{x). 
Growing the connected component. Specifically, in order to find out C\(x), we maintain a set of hyper- 
edges £\ and a set of vertices V\ . Throughout the process of exploring G, V\ is the set of uncolored vertices 
that are contained in some hyperedge in £\. Initially £\ = £(x). Then we independently color each vertex 
in V\ red or blue uniformly at random. After coloring each vertex, we update the state of every hyperedge 
that contains the vertex. That is, if any hyperedge Ei S V\ becomes safe, then we remove Ei from V\ and 
delete all the vertices that are only contained in Ei. On the other hand, once a hyperedge in V\ becomes 
dangerous-2 (it has &2 vertices, all the uncolored vertices in that hyperedge become trouble-2 and we skip 
the coloring of all such vertices. After the coloring of all vertices in V±, hyperedges in £\ are surviving 
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Phase 2 Coloring(a;) 

Input: a trouble-! vertex x G V 
Output: a color in {red, blue] or FAIL 

1. Start from £(x) to explore G in order to find the connected 

components of all the surviving-1 hyperedges around x 

2. If the size of the component is larger than c\ log N 

Abort and return FAIL 

3. Repeat the following 0( ^°f o ^ N ) times and stop if a good coloring is found 

(a) Color all the vertices in C\ (x) uniformly at random 

(b) Explore the dependency graph of G\s 1 ( x ) 

(c) Check if the coloring is good 

4. Return the color of x in the good coloring 



Figure 5: Phase 2 coloring algorithm 

hyperedges. Then we check all the hyperedges in G that are adjacent to the hyperedges in S\. If any of these 
hyperedges is not in the safe state, then we add it to £ \ and also add all its uncolored vertices to V\ . Now we 
repeat the coloring process described above for these newly added uncolored vertices. This exploration of 
the dependency graph terminates if, either there is no more hyperedge to color, or the number of surviving-1 
hyperedges in £\ is greater than c\ log N, where c\ is some absolute constant. The following Lemma shows 
that, almost surely, the size of C\(x) is at most c\ log N. 

Lemma 6.2 ((21). Let S C G be the set of surviving hyperedges after the first phase. Then with probability 
at least 1 — =^ (over the choices of random coloring), all connected components C\{x) of G\s have sizes 
at most c\ log N. 

Random coloring. Since C\[x) is not connected to any surviving-1 hyperedges in H, we can color the 
vertices in the connected component C\ (x) without considering any other hyperedges that are outside C\ (x). 
Now we follow a similar coloring process as in Phase 1 to color the vertices in C\(x) uniformly at random 
and in an arbitrary ordering. The only difference is, we ignore all the vertices that are already colored red 
or blue, and if k\ + ki vertices in a hyperedge get colored monochromatically, and all the rest of vertices in 
the hyperedge are in trouble-1 state, then this hyperedge will be in dangerous-2 state and all the uncolored 
vertices in it will be in trouble-1 state. Analogously we define unsafe-2 hyperedges as hyperedges whose 
vertices are either colored or in trouble-2 state and all the colored vertices are monochromatic. Finally, we 
say a hyperedge is a surviving-2 edge if it is in either dangerous-2 state or unsafe-2 state. 

Let S\(x) be the set of surviving hyperedges in C\(x) after all vertices in C\(x) are either colored or in 
trouble-2 state. Now we explore the dependency graph of S\{x) to find out all the connected components. 
Another application of Lemma I6T21 to G\s 1 ( x ) shows that with probability at least 1 — 0( 1q ^ 2 N ) (over the 
choices of random coloring), all connected components in G\g l {x) na v e sizes at most C2 log log N, where C2 
is some constant. We say a Phase 2 coloring is good if this condition is satisfied. Now if a random coloring 
is not good, then we erase all the coloring performed during Phase 2 and repeat the above coloring and 
exploring dependency graph process. We keep doing this until we find a good coloring. Therefore, after 
recoloring at most 0( ^f o ^ N ) times (and therefore with at most polylogA^ running time), we can, with 
probability at least 1 — 1/2N 2 , color C\(x) such that each connected component in G\g 1 (x) nas s i ze at most 
C2 log log N. By the union bound, with probability at least 1 — 1/2 N , the Phase 2 colorings for all connected 
components find some good colorings. 
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Phase 3 Coloring^) 

Input: a trouble-2 vertex x € V 
Output: a color in {red, blue] 

1 . Start from £(x) to explore G in order to find the connected 

component of all the surviving-2 hyperedges around x 

2. Go over all possible colorings of the connected component 

and color it using a feasible coloring. 

3. Return the color c of x in this coloring. 



Figure 6: Phase 3 coloring algorithm 

Running time analysis. Combining the analysis above with an argument similar to the running time analysis 
of Phase 1 coloring gives 

Claim 6.3. Phase 2 coloring takes at most polylogiV time. 
6.5 Phase 3 coloring 

In Phase 3, given a vertex x (which is necessarily trouble-2), we grow a connected component which includes 
x as in Phase 2, but of surviving-2 hyperedges. Denote this connected component of surviving-2 hyperedges 
by C2{x). By our Phase 2 coloring, the size of C2(x) is no greater than c 2 log log N. We then color the 
vertices in this connected component by exhaustive search. The existence of such a coloring is guaranteed 
by the Lovasz Local Lemma (Lemma 13. It . 

Claim 6.4. The time complexity of Phase 3 coloring is at most polylogiV. 

Proof. Using the same analysis as for Phase 2, in time 0(log log N) we can explore the dependency graph to 
grow our connected component of surviving-2 hyperedges. Exhaustive search of a valid two-coloring of all 
the vertices in C 2 (x) takes time at most tfHi&W) = 2°( 1 °s 1 °g A = polylogiV, as \C 2 (x)\ < c 2 log log N 
and each hyperedge contains k vertices. □ 

Finally, we remark that using the same techniques as those in 0, we can make our local computation 
algorithm run in parallel and find an ^-coloring of a hypergraph for any £ > 2 (an ^-coloring of a hypergraph 
is to color each vertex in one of the I colors such that each color appears in every hyperedge). 

7 fc-CNF 

As another example, we show our hypergraph coloring algorithm can be easily modified to compute a 
satisfying assignment of a fc-CNF formula, provided that the latter satisfies some specific properties. 

Let if be a /c-CNF formula on m Boolean variables x\, . . . ,x m . Suppose H has N clauses H = 
Ay A • • • A An and each clause consists of exactly k distinct literals^ We say two clauses A, and Aj 
intersect with each other if they share some variable (or the negation of that variable). As in the case for 
hypergraph coloring, k and d are fixed constants and all asymptotics are with respect to the number of 
clauses N (and hence m, since m < kN). Our main result is the following. 

7 Our algorithm works for the case that each clause has at least k literals; for simplicity, we assume that all clauses have uniform 
size. 
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Theorem 7.1. Let H be a k-CNF formula with k > 2. If each clause intersects no more than d other 
clauses and furthermore k and d are such that there exist three positive integers k\ , k 2 and h$ satisfying the 
fallowings relations: 



then there exists a local computation algorithm that, given any sequence of queries to the truth assignments 
of variables (xi,x 2 , ■ ■ ■ ,x s ), with probability at least 1 — 1/N, returns a consistent truth assignment for 
all Xi's which agrees with some satisfying assignment of the k-CNF formula H. Moreover, the algorithm 
answers each single query in 0((log N) c ) time, where c is some constant (depending only on k and d). 

Proof [Sketch]: We follow a similar algorithm to that of hypergraph two-coloring as presented in Section|6] 
Every clause will be in either initial, safe, unsafe-l, unsafe-2, dangerous-l or dangerous-2 state. Every 
variable will be in either unassigned, true-1, false-l, trouble-l or trouble-2 state. Initially every clause is in 
initial state and every variable is in unassigned state. Suppose we are asked about the value of a variable 
Xi. If Xi is in initial state, we randomly choose from {true, false} with equal probabilities and assign it to 
Xi. Then we update all the clauses that contain either Xi or xi accordingly: If the clause is already evaluated 
to true by this assignment of Xi, then we mark the literal as safe; if the clause is in initial state and is 
not safe yet and Xi is the k± literal in the clause that has been assigned values, then the clause is marked 
as dangerous-l and all the remaining unassigned variables in that clause are now in trouble-l state. We 
perform similar operations for clauses in other states as we do for the hypergraph coloring algorithm. The 
only difference is now we have Pr[Ai becomes dangerous-l] = 2~ kl , instead of 2 1_fcl as in the hypergraph 
coloring case. Following the same analysis, almost surely, all connected components in the dependency 
graph of unsafe-l clauses are of size at most 0(log N) and almost surely all connected components in the 
dependency graph of unsafe-2 clauses are of size at most 0(loglog N), which enables us to do exhaustive 
search to find a satisfying assignment. □ 

8 Concluding Remarks and Open Problems 

In this paper we propose a model of local computation algorithms and give some techniques which can 
be applied to construct local computation algorithms with polylogarithmic time and space complexities. It 
would be interesting to understand the scope of problems which can be solved with such algorithms and to 
develop other techniques that would apply in this setting. 
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